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Abstract — Coding in tlie projective space has received recently 
a lot of attention due to its application in network coding. 
Reduced row echelon form of the linear subspaces and Ferrers 
diagram can play a key role for solving coding problems in 
the projective space. In this paper we propose a method to 
design error-correcting codes in the projective space. We use 
a multilevel approach to design our codes. First, we select a 
constant weight code. Each codeword defines a skeleton of a 
basis for a subspace in reduced row echelon form. This skeleton 
contains a Ferrers diagram on which we design a rank-metric 
code. Each such rank-metric code is lifted to a constant dimension 
code. The union of these codes is our final constant dimension 
code. In particular the codes constructed recently by Koetter and 
Kschischang are a subset of our codes. The rank-metric codes 
used for this construction form a new class of rank-metric codes. 
We present a decoding algorithm to the constructed codes in 
the projective space. The efficiency of the decoding depends on 
the efficiency of the decoding for the constant weight codes and 
the rank-metric codes. Finally, we use puncturing on our final 
constant dimension codes to obtain large codes in the projective 
space which are not constant dimension. 

Index Terms — constant dimension codes, constant weight 
codes, reduced row echelon form, Ferrers diagram, identifying 
vector, network coding, projective space codes, puncturing, rank- 
metric codes. 



I. Introduction 

The projective space of order n over the finite field ¥q, de- 
noted Vq^n), is the set of all subspaces of the vector space F" 
Given a nonnegative integer k < n, the set of all subspaces of 
F" that have dimension k is known as a Grassmannian, and 
usually denoted by Qq{n, k). Thus, Vqin) = Uo<fe<n ^?("' ^)- 
It turns out that the natural measure of distance in Vq{n) is 
given by 



ds{Uy) =^ dim U + AimV -2 dim(C/ nV) 

for all U,V(^Vq{n). It is well known (cf.[l], [2]) that the 
function above is a metric; thus both 'Pq{n) and Qq{n^ k) can be 
regarded as metric spaces. Given a metric space, one can 
define codes. We say that CCVq[n) is an {n,M,d)q code 
in projective space if |C| = M and ds{U,V) > d for all 
U,V € C If an {n,M,d)q code C is contained in Qq(n,k) 
for some k, we say that C is an (n, M, d, k)q constant 
dimension code. The [n, M, d)q, respectively (n, M, d, k)q, 
codes in projective space are akin to the familiar codes in 
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the Hamming space, respectively (constant-weight) codes in 
the Johnson space, where the Hamming distance serves as the 
metric. 

Koetter and Kschischang [2] showed that codes in Vq{n) 
are precisely what is needed for error-correction in random 
network coding: an (n, M, d)q code can correct any t packet 
errors (the packet can be overwritten), which is equivalent to 
t insertions and t deletions of dimensions in the transmitted 
subspace, and any p packet erasures introduced (adversarially) 
anywhere in the network as long as 4i + 2/3 < d (see [3] 
for more details). This is the motivation to explore error- 
correcting codes in Vq{n) [4], [5], [6], [7], [8], [9], [10], [11]. 
Koetter and Kschischang [2] gave a Singleton like upper bound 
on the size of such codes and a Reed-Solomon like code 
which asymptotically attains this bound. Silva, Koetter, and 
Kschischang [3] showed how these codes can be described in 
terms of rank-metric codes [12], [13]. The related construction 
is our starting point in this paper. Our goal is to generalize 
this construction in the sense that the codes of Koetter and 
Kschischang will be sub-codes of our codes and all our codes 
can be partitioned into sub-codes, each one of them is a Koetter 
and Kschischang like code. In the process we describe some 
tools that can be useful to handle other coding problems in 
Vqin). We also define a new type of rank-metric codes and 
construct optimal such codes. Our construction for constant 
dimension codes and projective space codes uses a multilevel 
approach. This approach requires a few concepts which will 
be described in the following sections. 

The rest of this paper is organized as follows. In Section Ull 
we define the reduced row echelon form of a fc-dimensional 
subspace and its Ferrers diagram. The reduced row echelon 
form is a standard way to describe a linear subspace. The 
Ferrers diagram is a standard way to describe a partition of 
a given positive integer into positive integers. It appears that 
the Ferrers diagrams can be used to partition the subspaces 
of Vq{n) into equivalence classes [14], [15]. In Section |lll] 
we present rank-metric codes which will be used for our 
multilevel construction. Our new method requires rank-metric 
codes in which some of the entries are forced to be zeroes due 
to constraints given by the Ferrers diagram. We first present 
an upper bound on the size of such codes. We show how to 
construct some rank-metric codes which attain this bound. In 
Section |IV] we describe in details the multilevel construction 
of the constant dimension codes. We start by describing the 
connection of the rank-metric codes to constant dimension 
codes. This connection was observed before in [2], [3], [6], 
[7]. We proceed to describe the multilevel construction. First, 
we select a binary constant weight code C. Each codeword 
of C defines a skeleton of a basis for a subspace in reduced 



row echelon form. This skeleton contains a Ferrers diagram on 
which we design a rank-metric code. Each such rank-metric 
code is lifted to a constant dimension code. The union of these 
codes is our final constant dimension code. We discuss the 
parameters of these codes and also their decoding algorithms. 
In Section |V] we generalize the well-known concept of a 
punctured code for a code in the projective space. Puncturing 
in the projective space is more complicated than its counterpart 
in the Hamming space. The punctured codes of our constant 
dimension codes have larger size than the codes obtained by 
using the multilevel approach described in Section |IV] We 
discuss the parameters of the punctured code and also its 
decoding algorithm. Finally, in Section [Vl] we summarize our 
results and present several problems for further research. 

II. Reduced Echelon Form and Ferrers Diagram 

In this section we give the definitions for two structures 
which are useful in describing a subspace in 'Pq{n). The 
reduced row echelon form is a standard way to describe a 
linear subspace. The Ferrers diagram is a standard way to 
describe a partition of a given positive integer into positive 
integers. 

A matrix is said to be in row echelon form if each nonzero 
row has more leading zeroes than the previous row. 

A k X n matrix with rank k is in reduced row echelon form 
if the following conditions are satisfied. 

• The leading coefficient of a row is always to the right of 
the leading coefficient of the previous row. 

• All leading coefficients are ones. 

• Every leading coefficient is the only nonzero entry in its 
column. 

A fc-dimensional subspace X of F" can be represented by 
a k X n generator matrix whose rows form a basis for X. We 
usually represent a codeword of a projective space code by 
such a matrix. There is exactly one such matrix in reduced 
row echelon form and it will be denoted by E{X). 

Example 1: We consider the 3-dimensional subspace X of 
F2 with the following eight elements. 

1) (0 0) 



2) 


(1 





1 


1 








0) 


3) 


(1 








1 


1 





1) 


4) 


(1 





1 








1 


1) 


5) 


(0 





1 





1 





1) 


6) 


(0 








1 





1 


1) 


V) 


(0 





1 


1 


1 


1 


0) 


8) 


(1 











1 


1 


0) 



The basis of X can be represented by a 3 x 7 matrix whose 
rows form a basis for the subspace. There are 168 different 
matrices for the 28 different basis. Many of these matrices are 
in row echelon form. One of them is 



10 10 11 
11110 
10 11 



Exactly one of these 168 matrices is in reduced row echelon 
form. 



E(X): 



10 110 
10 10 1 
10 11 



A Ferrers diagram represents partitions as patterns of dots 
with the i-th row having the same number of dots as the i- 
th term in the partition [15], [16], [17]. A Ferrers diagram 
satisfies the following conditions. 

• The number of dots in a row is at most the number of 
dots in the previous row. 

• All the dots are shifted to the right of the diagram. 
The number of rows (columns) of the Ferrers diagram JT is the 
number of dots in the rightmost column (top row) of JT. If the 
number of rows in the Ferrers diagram is m and the number 
of columns is -q we say that it is an ttt, x 77 Ferrers diagram. 

If we read the Ferrers diagram by columns we get another 
partition which is called the conjugate of the first one. If the 
partition forms an m x rj Ferrers diagram then the conjugate 
partition form an rj x m Ferrers diagram. 

Example 2: Assume we have the partition 6 + 5 + 5 + 3 + 2 
of 21. The 5x6 Ferrers diagram JF of this partition is given 
by 



The number of rows in JF is 5 and the number of columns is 
6. The conjugate partition is the partition 5 + 5 + 4 + 3 + 3 + 1 
of 21 and its 6 x 5 Ferrers diagram is given by 



Remark 1: Our definition of Ferrers diagram is slightly 
different from the usual definition [15], [16], [17], where the 
dots in each row are shifted to the left of the diagram. 

Each /c-dimensional subspace X of F" has an identifying 
vector v{X). v{X) is a binary vector of length n and weight fc, 
where the ones in v{X) are in the positions (columns) where 
E{X) has the leading ones (of the rows). 

Example 3: Consider the 3-dimensional subspace X of 
Example [U Its identifying vector is v{X) = 1011000. 

Remark 2: We can consider an identifying vector v{X) for 
some fc-dimensional subspace X as a characteristic vector of a 
fc-subset. This coincides with the definition of rank- and order- 
preserving map (f> from Qq{n, k) onto the lattice of subsets of 
an n-set, given by Knuth [14] and discussed by Milne [18]. 

The following lemma is easily observed. 

Lemma 1: Let X be a fc-dimensional linear subspace of F" 
v{X) its identifying vector, and ii,i2, ■ ■ ■ ,ik the positions in 
which v{X) has ones. Then for each nonzero element u E X 
the leftmost one in u is in position ij for some 1 < j < fc. 



Proof: Clearly, for each j, I < j < k, there exists 
an element Uj e X whose leftmost one is in position ij. 
Moreover, ui,U2, . . . ,Uk are linearly independent. Assume 
the contrary, that there exists an element u ^ X whose 
leftmost one is in position £ ^ {ii, . . . , ik}- This implies that 
u,ui,U2, ■ ■ ■ ,Uk are linearly independent and the dimension 
of X is at least fc + 1, a contradiction. ■ 

The following result will play an important role in the 
proof that our constructions for error-correcting codes in the 
projective space have the desired minimum distance. 

Lemma 2: If X and Y are two subspaces of Vq{n) 
with identifying vectors v{X) and v{Y), respectively, then 
ds{X,Y) > dH{v{X),v{Y)), where dH{u,v) denotes the 
Hamming distance between u and v. 

Proof: Let ii,...,v be the positions in which v{X) 
has ones and v{Y) has zeroes and ji, ....is be the positions 
in which v{Y) has ones and v{X) has zeroes. Clearly, r + 
s ~ dH{v{X),v{Y)). Therefore, by Lemma [T] X contains r 
linearly independent vectors ui, ...,Ur which are not contained 
in Y. Similarly, Y contains s linearly independent vectors 
which are not contained in X. Thus, ds{X,Y) > r + s — 
dH{v{X),v{Y)). m 

The echelon Ferrers form of a vector v of length n and 
weight fc, EF{v), is the kxn matrix in reduced row echelon 
form with leading entries (of rows) in the columns indexed 
by the nonzero entries of v and " • " in all entries which do 
not have terminals zeroes or ones. A " • " will be called in 
the sequel a dot. This notation is also given in [15], [17]. The 
dots of this matrix form the Ferrers diagram of EF{v). If we 
substitute elements of F^ in the dots of EF{v) we obtain a fc- 
dimensional subspace X of 'Pq{n). EF{v) will be called also 
the echelon Ferrers form of X. 

Example 4: For the vector v = 1001001, the echelon 
Ferrers form EF{v) is the following 3x7 matrix. 



EF{v) 



1 • • • • 
1 • • 
1 



EF{v) has the following 2x4 Ferrers diagram 



r-- 



Each binary word v of length n and weight fc corresponds 
to a unique kxn matrix in an echelon Ferrers form. There 
are a total of (^) binary vectors of length n and weight fc and 
hence there are (^^) different fc x n matrices in echelon Ferrers 
form. 

III. Ferrers Diagram Rank-Metric Codes 

In this section we start by defining the rank-metric codes. 
These codes are strongly connected to constant dimension 
codes by a lifting construction described by Silva, Kschis- 
chang, and Koetter [3]. We define a new concept which is 
a Ferrers diagram rank-metric code. Ferrers diagram rank- 
metric codes will be the main building blocks of our projective 



space codes. These codes present some questions which are 
of interest for themselves. 

For two m x r] matrices A and B over F^ the rank distance 
is defined by 

dniA, Bf=mnk{A - B) . 

A code C is an [m x r/, g, S] rank-metric code if its codewords 
are m x r] matrices over ¥q, they form a linear subspace of 
dimension g of F™^'', and for each two distinct codewords 
A and B we have that dii{A, B) > S. Rank-metric codes 
were well studied [12], [13], [19]. It was proved (see [13]) 
that for an [m x rj, g, S] rank-metric code C we have g < 
mm{m{ri — S + l),ri{m — 6 + 1)}. This bound is attained for 
all possible parameters and the codes which attain it are called 
maximum rank distance codes (or MRD codes in short). 

Let w be a vector of length n and weight k and let EF{v) 
be its echelon Ferrers form. Let T be the Ferrers diagram of 
EF{v). T is an m X f] Ferrers diagram, m < k, -q < n — k. 
A code C is an [T, g, S] Ferrers diagram rank-metric code if 
all codewords are m x 77 matrices in which all entries not in 
T are zeroes, it forms a rank-metric code with dimension g, 
and minimum rank distance 6. Let dim(jr, 5) be the largest 
possible dimension of an [!F, g, d] code. 

Theorem 1: For a given i,Q <i < 5 — 1, if Vi is the number 
of dots in T, which are not contained in the first i rows and 
are not contained in the rightmost 5 — 1 — i columns then 
mini{i/i} is an upper bound of dim(J^, 5). 

Proof: For a given i, Q < i < 5 — 1, \t\. Ai be the 
set of the Vi positions of JT which are not contained in the 
first i rows and are not contained in the rightmost 5 — 1 — i 
columns. Assume the contrary that there exists an [!F, Vi + 
1, S\ code C. Let B = {Bi, B2, ■ ■ . , B^^+i} be a set of i^,; + 
1 linearly independent codewords in C. Since the number of 
linearly independent codewords is greater than the number of 
entries in Ai there exists a nontrivial linear combination Y = 
Yl'iLi ctjBj for which the Vi entries of Ai are equal zeroes. 

Y is not the all-zeroes codeword since the B/s are linearly 
independent. !F has outside Ai exactly i rows and S — i ~ 1 
columns. These i rows can contribute at most i to the rank of 

Y and the S — i — 1 columns can contribute at most S — i — 1 to 
the rank of Y. Therefore F is a nonzero codeword with rank 
less than 6, a contradiction. 

Hence, an upper bound on dim(JF, S) is i^,; for each < 
i < S — 1. Thus, an upper bound on the dimension dim(jr, S) 
is minijfi}. ■ 

Conjecture 1: The upper bound of Theorem [T] is attainable 
for any given set of parameters q, T, and 8. 

If we use i = Oori = ^ — lin Theorem [T] we obtain the 
following result. 

Corollary 1: An upper bound on dim( J^. S) is the minimum 
number of dots that can be removed from T such that the 
diagram remains with at most (5—1 rows of dots or at most 
(5 — 1 columns of dots. 

Remark 3: [m x 7j, g, S] MRD codes are one class of Ferrers 
diagram rank-metric codes which attain the bound of Corol- 
lary [T] with equality. In this case the Ferrers diagram has m • r/ 
dots. 



Example 5: Consider the following Ferrers diagram 



We define an [m x 77, m{rj — S + 1),5] rank-metric code C, 
m = f] + e, derived from C as follows: 



:f = 



and (5 = 3. By Corollary [T] we have an upper bound, 
dim(jF, 3) < 2. But, if we use i = I in Theorem [T] then 
we have a better upper bound, dim(jF, 3) < 1. This upper 
bound is attained with the following generator matrix of an 
[J-, 1, 3] rank-metric code. 



/ 1 \ 

10 



\ 1 / 



When the bound of Theorem [T] is attained? We start with 
a construction of Ferrers diagram rank-metric codes which 
attain the bound of Corollary [T] Assume we have an to x 77, 
m — rj + e, e > 0, Ferrers diagram !F and that the minimum in 
the bound of Corollary [T] is obtained by removing all the dots 
from the rj — S+l leftmost columns of T. Hence, only the dots 
in the (5—1 rightmost columns will remain. We further assume 
that each of the S — I rightmost columns of JF have ni dots. 
The construction which follows is based on the construction 
of MRD g-cyclic rank-metric codes given by Gabidulin [12]. 

A code C of length m over F^m is called a q-cyclic code if 
(co,ci,...,c,„-i) eC implies that (c^_i, eg, ..., c^.j) G C. 

For a construction of [m x m, g, 5] rank-metric codes, we 
use an isomorphism between the field with q™ elements, F^™ , 
and the set of all m-tuples over ¥q, F™. We use the obvious 
isomorphism by the representation of an element a in the 
extension field Fgm as a = {ai, . . . ,am), where a^ is an 
element in the ground field Fg. Usually, we will leave to the 
reader to realize when the isomorphism is used as this will be 
easily verified from the context. 

A codeword c in an [m x m, g, S] rank-metric code C, can 
be represented by a vector c — (co, ci, . . . , Cm-i), where Ci G 
Fg.Ti and the generator matrix G of C is an /f x m matrix, 
g = mK. It was proved by Gabidulin [12] that if C is an 
MRD g-cyclic code then the generator polynomial of C is the 

Unearized polynomial G{x) = Yl 9i^'^ ^ where gi G Fgm, 

9m-K — 1, m — K + 5 — 1, and its generator matrix G has 
the form 



/ 



V 



9a 


Si 





9l 









9„^-K-l 



9o 





1 
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-K-1 


1 








g-K-1 


1 


/ 



Hence, a codeword c G C, c G (F^™)™, derived from the 
information word (ao, ai, . . . , a/<-_i), where ai G F^™, i.e. 
c = (flo, ai, . . . , aK-i)G, has the form 



C = {(co,ci, . 



1) : (0, 



, O,co,ci, 



c^-i)gC}. 



Remark 4: C is also an MRD code. 

We construct an [T, i, S] Ferrers diagram rank-metric code 
Cjr C C', where T is an m x rj Ferrers diagram. Let 7^, 
1 < i < ?/, be the number of dots in column i of T, where 
the columns are indexed from left to right. A codeword of Cj^ 
is derived from a codeword of c G C by satisfying a set of m 
equations implied by 



^aoffo,ao5i 




V 



a-ig^, 



■ , O.K-2 + O.K^ 



S-1 



\ 



h ■•■/. 



K- 



I 



(1) 

where /; = (•••■• •• ■ 0)^ is a column vector of length to, 

1 < i < K — e, and u^ denotes the transpose of the vector u. 

It is easy to verify that Cyr is a linear code. 

By ([TJ we have a system of m = K + 6 —1 equations with 

K variables, ao,ai, . . . , ax-i- The first e equations implies 

that Oi = for < i < £ - 1. The next K - e ^ rj - 5 + I 

equations determine the values of the a^'s, e < i < K ~ \, 

as follows. From the next equation ae^g = (•■•■• 00. ..0)^ 

71 m-71 
(this is the next equation after we substitute ai — Q for < 

i < e — 1), we have that a^ has q^^ solutions in F^m, where 

each element of F^m is represented as an r?T,-tuple over Fg. 

Given a solution of fle, the next equation fle^Q +0^+19^ — 

(••■•• 00... 0)-^ has q^^ solutions for a^+i. Therefore, we have 

72 m—^2 

that ao, fli, . . . , ttK-i have q^i=i '>'• solutions and hence the 
dimension of Cjr is J2i=i'^ li o^^r ^q- Note, that since each of 
the (5 — 1 rightmost columns of T have to dots, i.e. 7^ = m, 
K — e + l<i<ri (no zeroes in the related equations) it 
follows that any set of values for the a/s cannot cause any 
contradiction in the last (5—1 equations. Also, since the values 
of the K variables ag, ai, . . . , a^^i are determined for the last 
(5—1 equations, the values for the related ((5 — l)ni dots are 
determined. Hence they do not contribute to the number of 
solutions for the set of m equations. Thus, we have 

Theorem 2: Let JF be an to, x 77, 771 > 77, Ferrers diagram. 
Assume that each one of the rightmost (5—1 columns of T 
has m dots, and the i-th column from the left of JT has 7^ 
dots. Then Cjr is an [JT, Yll=i li^^] code which attains the 
bound of Corollary [T] 



Remark 5: For any solution of ao,ai 



(ao5o,ao5i + ai5o> 



, aK-2 + aK-l9l^_K-l^'^K-l) 



that (ao,ai, . 
and (co,ci,. 



.,aK_i)G = (0,. 

, C77-1) G Cjr. 



. ,0,co,ci, 



, ax~i we have 
,c,,_i) G C 



Remark 6: For any [mxr], ni{j]—S+l), S] rank-metric code 
C, the codewords which have zeroes in all the entries which 
are not contained in T form an [JF, J2l=i 7ii ^] code. Thus, 
we can use also any MRD codes, e.g. the codes described 
in [13], to obtain a proof for Theorem |2] 

Remark 7: Since Cy^ is a subcode of an MRD code then 
we can use the decoding algorithm of the MRD code for the 
decoding of our code. Also note, that if T is an m x j], m < rj, 
Ferrers diagram then we apply our construction for the rj xm 
Ferrers diagram of the conjugate partition. 

When S — 1 the bounds and the construction are trivial. If 
(5 = 2 then by definition the rightmost column and the top 
row of an m X 77 Ferrers diagram always has m dots and rj 
dots, respectively. It implies that the bound of Theorem [T] is 
always attained with the construction if S = 2. This is the 
most interesting case since in this case the improvement of 
our constant dimension codes compared to the codes in [2], 
[3] is the most impressive (see subsection llV-Cb . If 6 > 2 the 
improvement is relatively small, but we will consider this case 
as it is of interest also from a theoretical point of view. Some 
constructions can be given based on the main construction and 
other basic constructions. We will give two simple examples 
for (5 = 3. 

Example 6: Consider the following Ferrers diagram 



The upper bound on dim(JF, 3) is 3. It is attained with the 
following basis with three 4x4 matrices. 



/oioo\/oooi\/iooo\ 
0010 0100 0100 
0000 ' 0010 ' 0001 

\0001/\000O/\0001/ 
Example 7: Consider the following Ferrers diagram 



:f = 



The upper bound on dim(jr, 3) is 4. It is attained with the 
basis consisting of four 4x4 matrices, from which three are 
from Example |6] and the last one is 



/ 1 1 \ 

1 

10 1 

\ / 



As for more constructions, some can be easily generated 
by the interested reader, but whether the upper bound of 
Theorem [1] can be attained for all parameters remains an open 
problem. 



IV. Error-Correcting Constant Dimension Codes 

In this section we will describe our multilevel construction. 
The construction will be applied to obtain error-correcting 
constant dimension codes, but it can be adapted to construct 
error-correcting projective space codes without any modifi- 
cation. This will be discussed in the next section. We will 
also consider the parameters and decoding algorithms for 
our codes. Without loss of generality we will assume that 
k < n — k. This assumption can be made as a consequence of 
the following lemma [4], [11]. 

Lemma 3: If C is an {n, M, 2(5, k)q constant dimension 
code then C^ = {X^ : X G C}, where X-^ is the orthogonal 



subspace of X, is an (n, M, 26, n 
code. 



k)q constant dimension 



A. Lifted codes 

Koetter and Kschischang [2] gave a construction for con- 
stant dimension Reed-Solomon like codes. This construc- 
tion can be presented more clearly in terms of rank-metric 
codes [3]. Given an [k x {n — k), g, S] rank-metric code C we 
form an {n,q^,2S,k)q constant dimension code C by lifting 
C, i.e., C = {[/fc A] : Ae C}, where Ik is the /c x fc identity 
matrix [3]. We will call the code C the lifted code of C. Usually 
C is not maximal and it can be extended. This extension 
requires to design rank-metric codes, where the shape of a 
codeword is a Ferrers diagram rather than an A: x [n—k) matrix. 
We would like to use the largest possible Ferrers diagram rank- 
metric codes. In the appropriate cases, e.g. when (5 = 2, we 
will use the codes constructed in Section |lll] for this purpose. 

Assume we are given an echelon Ferrers form EF{v) of 
a binary vector v, of length n and weight k, with a Ferrers 
diagram T and a Ferrers diagram rank-metric code Cjr. Cjf is 
lifted to a constant dimension code C„ by substituting each 
codeword A £ Cjr in the columns of EF{v) which correspond 
to the zeroes of v. Note, that depending on JT it might implies 
conjugating T first. Unless v starts with an one and ends with 
a zero (the cases in which JF is a fc x (?i — fc) Ferrers diagram) 
we also need to expand the matrices of the Ferrers diagram 
rank-metric code to kx {n — k) matrices (which will be lifted), 
where J^ is in their upper right corner (and the new entries 
are zeroes). As an immediate consequence from [3] we have. 

Lemma 4: If Cjf is an [JT, g, S\ Ferrers diagram rank-metric 
code then its lifted code C„, related to an fc x n echelon Ferrers 
form EF[v), is an (n, g^, 2(5, k)q constant dimension code. 



Example 8: For the word v 
form 



1110000, its echelon Ferrers 



EF{v) 



the 3x4 matrix 



10 
10 
1 



10 10 

0001 
0000 



is lifted to the 3-dimensional subspace with the 3x7 generator 
matrix 



For the word v = 1001001, its echelon Ferrers form 

EF{v) = 







the 2 X 4 matrix 



is lifted to the 3-dimensional subspace with the 3x7 generator 
matrix 

"iiooioo 
10 10 
1 



Theorem 3: C is an (n, M, 25, k)q constant dimension 

code, where M = J^cec l*^c|- 

Example 9: Let n = 6, fc = 3, and C = 
{111000, 100110, 010101, 001011} a constant weight code 
of length 6, weight 3, and minimum Hamming distance 4. The 
echelon Ferrers forms of these 4 codewords are 





1 








= 





1 













1 



£;f(iooiio) : 



1 • • 
10 
1 



The code described in [3] is the same as the code described 
in Section |III1 where the identifying vector is (1 • • • 10 • • • 0). 
If our lifted codes are the codes described in Section |lll] then 
the same decoding algorithm can be applied. Therefore, the 
decoding in [3] for the corresponding constant dimension code 
can be applied directly to each of our lifted constant dimension 
codes in this case, e.g. it can always be applied when S — 2.1t 
would be worthwhile to permute the coordinates in a way that 
the identity matrix I^ will appear in the first k columns, from 
the left, of the reduced row echelon form, and T will appear 
in the upper right corner of the k x n matrix. The reason is 
that the decoding of [3] is described on such matrices. 



B. Multilevel construction 



{n, M, 2(5, fc). 



constant 



Assume we want to construct an 
dimension code C 

The first step in the construction is to choose a binary 
constant weight code C of length n, weight k, and minimum 
distance 26. This code will be called the skeleton code. Any 
constant weight code can be chosen for this purpose, but differ- 
ent skeleton codes will result in different constant dimension 
codes with usually different sizes. The best choice for the 
skeleton code C will be discussed in the next subsection. The 
next three steps are performed for each codeword c G C. 

The second step is to construct the echelon Ferrers form 
EF{c). 

The third step is to construct an [T, g, S] Ferrers diagram 
rank-metric code Cjr for the Ferrers diagram T of EF{c). If 
possible we will construct a code as described in Section |III] 

The fourth step is to lift Cjr to a constant dimension code 
Cc, for which the echelon Ferrers form of X G Cc is EF{c). 

Finally, 

C = U Cc . 
cec 

As an immediate consequence of Lemmas |2] and S] we have 
the following theorem. 



£;f{oioioi) : 



EF(OOlOll) : 



1 • • 
1 • 
1 



1*00 
10 
1 



By Theorem |2] the Ferrers diagrams of these four echelon 
Ferrers forms yield Ferrers diagram rank-metric codes of sizes 
64, 4, 2, and 1, respectively. Hence, we obtain a (6, 71, 4, 3)2 
constant dimension code C 



Remark 8: A (6, 74, 4, 3)2 code was obtained by computer 
search [8]. Similarly, we obtain a (7,289,4,3)2 code. A 
(7, 304, 4, 3)2 code was obtained by computer search [8]. 

Example 10: Let C be the codewords of weight 4 in the 
[8,4,4] extended Hamming code with the following parity- 
check matrix. 



00001111 
00110011 
01010101 

11111111 



C has 14 codewords with weight 4. Each one of these 
codewords is considered as an identifying vector for the 
echelon Ferrers forms from which we construct the final 
(8, 4573, 4, 4)2 code C. The fourteen codewords of C and their 
contribution for the final code C are given in the following 
table. The codewords are taken in lexicographic order. 





ceC 


size of Cc 


1 


11110000 


4096 


2 


11001100 


256 


3 


11000011 


16 


4 


10101010 


64 


5 


10100101 


16 


6 


10011001 


16 


7 


10010110 


16 


8 


01101001 


32 


9 


01100110 


16 


10 


01011010 


16 


11 


01010101 


8 


12 


00111100 


16 


13 


00110011 


4 


14 


00001111 


1 



C. Code parameters 

We now want to discuss the size of our constant dimension 
code, the required choice for the skeleton code C, and compare 
the size of our codes with the size of the codes constructed 
in [2], [3]. 

The size of the final constant dimension code C depends 
on the choice of the skeleton code C. The identifying vector 
with the largest size of corresponding rank-metric code is 
1---10---G. The corresponding [fc x (n — fc),i?,(5] rank- 



k n — k 

metric code 



has dimension £ 



hence it contributes g*^" ''')'''' ^^^ 



{n - k){k -5+1) and 
' fc-dimensional subspaces 
to our final code C These subspaces form the codes in [2], 
[3]. The next identifying vector which contributes the most 
number of subspaces to C is 11. ..10 • • • Oil. ..1000. ..GO. The 

k—5 S 5 n—k~5 

number of subspaces it contributes depends on the bounds 
presented in Sectionlllll The rest of the code C usually has less 
codewords from those contributed by these two. Therefore, the 
improvement in the size of the code compared to the code 
of [2] is not dramatic. But, for most parameters our codes are 
larger than the best known codes. In some cases, e.g. when 
5 = k our codes are as good as the best known codes (see [4]) 
and suggest an alternative construction. When k — i, 5 — A, 
and reasonably small n, the cyclic codes constructed in [4], 
[8] are larger. 

Two possible alternatives for the best choice for the skeleton 
code C might be of special interest. The first one is for k — 4 
and n which is a power of two. We conjecture that the best 
skeleton code is constructed from the codewords with weight 
4 of the extended Hamming code for which the columns of 
the parity-check matrix are given in lexicographic order. We 
generalize this choice of codewords from the Hamming code 
by choosing a constant weight lexicode [20]. Such a code is 
constructed as follows. All vectors of length n and weight k 
are listed in lexicographic order The code C is generated by 
adding to the code C one codeword at a time. At each stage, 
the first codeword of the list that does not violate the distance 
constraint with the other codewords of C, is joined to C. 
Lexicodes are not necessarily the best constant weight codes. 



For example, the largest constant weight code of length 10 and 
weight 4 is 30, while the lexicode with the same parameters 
has size 18. But, the constant dimension code derived from the 
lexicode is larger than any constant dimension code derived 
from any related code of size 30. 

The following table summarized the sizes of some of our 
codes compared to previous known codes. In all these codes 
we have started with a constant weight lexicode in the first 
step of the construction. 



q 


ds{C) 


n 


k 


code size [2] 


size of our code 


2 


4 


9 


4 


2i5 


2^'V4177 


2 


4 


10 


5 


2^0 


2^"h-118751 


2 


4 


12 


4 


224 


2^4+2290845 


2 


6 


10 


5 


2i5 


2^^H-73 


2 


6 


13 


4 


2i8 


2^«-h4357 


2 


8 


21 


5 


232 


2^^+16844809 


3 


4 


7 


3 


3« 


3**H-124 


3 


4 


8 


4 


3^^ 


3^"-h8137 


4 


4 


7 


3 


4« 


4«H-345 


4 


4 


8 


4 


4i2 


4^^H-72529 



D. Decoding 

The decoding of our codes is quite straightforward and it 
mainly consists of known decoding algorithms. As we used a 
multilevel coding we will also need a multilevel decoding. In 
the first step we will use a decoding for our skeleton code and 
in the second step we will use a decoding for the rank-metric 
codes. 

Assume the received word was a fc-dimensional subspace 
Y. We start by generating its reduced row echelon form 
E{Y). Given E{Y) it is straightforward to find the identifying 
vector v{Y). Now, we use the decoding algorithm for the 
constant weight code to find the identifying vector v{X) of the 
submitted fc-dimensional subspace X. If no more than 6 — 1 
errors occurred then we will find the correct identifying vector. 
This claim is an immediate consequence of Lemma |2] 

In the second step of the decoding we are given the received 
subspace Y, its identifying vector v{Y), and the identifying 
vector v{X) of the submitted subspace X. We consider the 
echelon Ferrers form EF{v{X)), its Ferrers diagram JT, and 
the [T, g, 6] Ferrers diagram rank-metric code associated with 
it. We can permute the columns of EF{v{X)), and use the 
same permutation on Y, in a way that the identity matrix Ik 
will be in the left side. Now, we can use the decoding of 
the specific rank-metric code. If our rank-metric codes are 
those constructed in Section Hill then we can use the decoding 
as described in [3]. It is clear now that the efficiency of 
our decoding depends on the efficiency of the decoding of 
our skeleton code and the efficiency of the decoding of our 
rank-metric codes. If the rank-metric codes are MRD codes 
then they can be decoded efficiently [12], [13]. The same is 
true if the Ferrers diagram metric codes are those constructed 
in Section [III] as they are subcodes of MRD codes and the 
decoding algorithm of the related MRD code applied to them 
too. 

There are some alternative ways for our decoding, some 
of which improve on the complexity of the decoding. For 



example we can make use of the fact that most of the code 
is derived from two identifying vectors or that most of the 
rank-metric codes are of relatively small size. One such case 
can be when all the identity matrices of the echelon Ferrers 
forms are in consecutive columns of the codeword (see [10]). 
We will not discuss it as the related codes hardly improve on 
the codes in [2], [3]. 

Finally, if we allow to receive a word which is an i- 
dimensional subspace Y, fc — (5 + 1 < i < k + S — 1, then 
the same procedure will work as long as ds{X, Y) < S — 1. 
This is a consequence of the fact that the decoding algorithm 
of [3] does not restrict the dimension of the received word. 

V. Error-Correcting Projective Space Codes 

In this section our goal will be to construct large codes in 
Pq{n) which are not constant dimension codes. We first note 
that the multilevel coding described in Section |IV] can be used 
to obtain a code in Pq{n). The only difference is that we should 
start in the first step with a general binary code of length n 
in the Hamming space as a skeleton code. The first question 
which will arise in this context is whether the method is as 
good as for constructing codes in Sj(n, k). The answer can be 
inferred from the following example. 

Example 11: Let n = 7 and d = 3, and consider the [7,4,3] 
Hamming code with the parity-check matrix 



1111 
110 110 
10 110 10 



By using the multilevel coding with this Hamming code we 
obtain a code with minimum distance 3 and size 394 in 7^2(7). 

As we shall see in the sequel this code is much smaller 
than a code that will be obtained by puncturing. We have also 
generated codes in the projective space based on the multilevel 
construction, where the skeleton code is a lexicode. The 
constructed codes appear to be much smaller than the codes 
obtained by puncturing. Puncturing of a code C (or union of 
codes with different dimensions and the required minimum 
distance) obtained in Section |IV] results in a projective space 
code C. If the minimum distance of C is 2(5 then the minimum 
distance of C is 25 — 1. C has a similar structure to a code 
obtained by the multilevel construction (similar structure in 
the sense that the identifying vectors of the codewords can 
form a skeleton code). But the artificial "skeleton code" can 
be partitioned into pair of codewords with Hamming distance 
one, while the distance between two codewords from different 
pairs is at least 26—1. This property yields larger codes by 
puncturing, sometimes with double size, compared to codes 
obtained by the multilevel construction. 

A. Punctured codes 

Puncturing and punctured codes are well known in the 
Hamming space. An (n, M, d) code in the Hamming space 
is a code of length n, minimum Hamming distance d, and M 
codewords. Let C be an {n, M, d) code in the Hamming space. 



Its punctured code C is obtained by deleting one coordinate 
of C. Hence, there are n punctured codes and each one is an 
(n — 1, M, d—1) code. In the projective space there is a very 
large number of punctured codes for a given code C and in 
contrary to the Hamming space the sizes of these codes are 
usually different. 

Let X be an i?-subspace of F" such that the unity vector 
with an one in the i-th coordinate is not an element in X. 
The i-coordinate puncturing of X, Ai{X), is defined as the 
^-dimensional subspace of F"^^ obtained from X by deleting 
coordinate i from each vector in X. This puncturing of a 
subspace is akin to puncturing a code C in the Hamming 
space by the i-th coordinate. 

Let C be a code in Vq{n) and let Q be an {n — 1)- 
dimensional subspace of F". Let E{Q) be the (n — 1) x n 
generator matrix of Q (in reduced row echelon form) and let 
T be the position of the unique zero in v{Q). Let v G F" be 
an element such that v ^ Q. We define the punctured code 



where 



and 



'^'q.v = '^Q UCq,!, 



Cq = {Ar{X) : X eC, XCQ}) 



,,« = {A^(x n Q) : X e C, V e X} 



Remark 9: If C was constructed by the multilevel construc- 
tion of Section IrVl then the codewords of Cg and Cq„ can be 
partitioned into related lifted codes of Ferrers diagram rank- 
metric codes. Some of these codes are cosets of the linear 
Ferrers diagram rank-metric codes. 

The following theorem can be easily verified. 

Theorem 4: The punctured code Cg ^ of an {n,M,d)q 
code C is an (n — 1, M', d — l)g code. 

Remark 10: The code C = {X : X E C, X C Q}) U 
{X nQ : X e C, V e X} is cin (n, M',d-l)q code whose 
codewords are contained in Q. Since Q is an (n — 1)- 
dimensional subspace it follows that there is an isomorphism ip 
such that ifiQ) = F^-^. The code ^{C) = {(^(X) : X e C} 
is an {n — 1, M' , d — \)q code. The code Cq ^ was obtained 
from C by such isomorphism which uses the r-coordinate 
puncturing on all the vectors of Q. 

Example 12: Let C be the (8,4573,4,4)2 code given in 
Example [TOl Let Q be the 7-dimensional subspace whose 7x8 
generator matrix is 



(I 
1 



V 



\ 




1 0/ 



By using puncturing with Q and v — 1000001 we obtain a 
code Cq „ with minimum distance 3 and size 573. By adding 
to Cq ^ two codewords, the null space {0} and F2 we obtained 
a (7,575,3)2 code in V2{7)- The following tables show the 
number of codewords which were obtained from each of the 
identifying vectors with weight 4 of Example [TO] 



Cq 


identifying vector 


addition to Cq 


11110000 


256 


11001100 


16 


10101010 


8 


10010110 


2 


01100110 


4 


01011010 


2 


00111100 


1 




Cq,„ V ^ 10000001 


identifying vector 


addition to Cq,.(, 


11110000 


256 


11001100 


16 


11000011 


1 


10101010 


4 


10100101 


2 


10011001 


4 


10010110 


1 



The Ferrers diagram rank-metric codes in some of the entries 
must be chosen (if we want to construct the same code by 
a multilevel construction) in a clever way and not directly as 
given in Section |III] We omit their description from lack of 
space and leave it to the interested reader 

The large difference between the sizes of the codes of 
Examples [TT| and [12] shows the strength of puncturing when 
applied on codes in Vq{n). 

B. Code parameters 

First we ask, what is the number of codes which can be 
defined in this way from C? Q is an [n — 1) -dimensional 
subspace of F" and hence it can be chosen in '^„ ~/ different 

ways. There are 



9-1 



9-1 



= g" distinct way to choose 
V ^ Q after Q was chosen. Thus, we have that usually 

271-1 „n-l 

puncturing of a code C in 'Pq{n) will result in 2- 
different punctured codes. 

Theorem 5: If C is an (n, ALd,k)„ code then there ex- 



9-1 



ists an (n — 

M(g"~'°+g'°-2) 



l,M',d - l)g code Cq„ such that M' > 



Proof: As before, Q can be chosen in ^—-f- differ- 
ent ways. By using basic enumeration, it is easy to verify 
that each /c-dimensional subspace of Vq{n) is contained in 
- — ^^ {n — 1) -dimensional subspaces of 'Pq{n). Thus, by a 
simple averaging argument we have that there exists an (n— 1)- 
dimensional subspace Q such that |Cq| > M^ 



q^-l ■ 

There are M— ICnl codewords in C which are not contained 



in Q. For each such codeword X G C we have dm\{X r\Q) — 
fc — 1. Therefore, X contains q^ — q^^^ vectors which do not 
belong to Q. In F" there are g" — q"^^ vectors which do not 
belong to Q. Thus, again by using simple averaging argument 
we have that there exist an (n — l)-dimensional subspace 
Q C F'^' and u ^ g such that |Cq,„| > ^'''^''^gi^^f-T'''"'^ = 

M-|Co| 



Therefore, there exists an [n — l,M',d — l)q code Cq ^ 



such that M' 



(9" 



Zq\+M 



> 



(9" 



> 

-1)M(9^ 



(,--l)g" 



|Cq|9"''°+M- 

qn-k 

-1)+M(g" 



M(9"~'°+g'°-2) 



Clearly, choosing the (n — 1) -dimensional subspace Q and 
the element w in a way that C'g ^ will be maximized is 
important in this context. Example [12] can be generalized in a 
very simple way. We start with a (4fc, g2fe(fc+i)^ 2fc^ 2k)q code 
obtained from the codeword 1 • • ■ 10 • • • in the multilevel 

2fe 2k 

approach. We apply puncturing with the (n — 1) -dimensional 
subspace Q whose (n — 1) x n generator matrix is 



/I 




V 



\ 




1 o; 



It is not difficult to show that in the [{2k) x {2k),2k{k + 
1), fc] rank-metric code C there are q'^'^ codewords with zeroes 
in the last column and q'^'^' codewords with zeroes in the first 
row. There is also a codeword whose first row ends with a 
one. If u is this first row which ends with a one there are 
(jr^fc codewords whose first row is u. We choose v to be v = 
10 • • • Gu. By using puncturing with Q and v we have 



,2fe^ 



2k- 



q- and |Cg,„| = q^fc . Hence, C^_„ is a (4fc-l,2g^ 
l)g code in Vq{^k — 1). By using more codewords from the 
constant weight code in the multilevel approach and adding 
the null space and F^'^'^^ to the code we construct a slightly 
larger code with the same parameters. 



C. Decoding 

We assume that C is an (n, M, d)q code and that all the 
dimensions of the subspaces in C have the same parity which 
implies that d = 26. This assumption makes sense as these 
are the interesting codes on which puncturing is applied, 
similarly to puncturing in the Hamming space. We further 
assume for simplicity that w.l.o.g. if E{Q) is the (n — 1) x n 
generator matrix of Q then the first n — 1 columns are linearly 
independent, i.e., E{Q) = [/ u], where / is an (n— 1) x {n—1) 
unity matrix and m is a column vector of length n — 1. 

Assume that the received word from a codeword X' of Cg „ 
is an ^^-dimensional subspace Y' of F"~^. The first step will 
be to find a subspace Z of F" on which we can apply the 
decoding algorithm of C The result of this decoding will 
reduced to the (n — 1) -dimensional subspace Q and punctured 
to obtain the codeword of Cq „ . We start by generating from 
Y' an ^-dimensional subspace F C Q of F" This is done by 
appending a symbol to the end of each vector in Y' by using 
the generator matrix E{Q) of Q. If a generator matrix E{Y') 
is given we can do this process only to the rows of E{Y') to 
obtain the generator matrix E{Y) of Y . We leave to the reader 
the verification that the generator matrix of Y is formed in its 
reduced row echelon form. 



10 



Remark 11: If the zero of v{Q) is in coordinate r then 
instead of appending a symbol to the end of the codeword we 
insert a symbol at position r. 

Let I be the dimension of Y' and assume p is the parity of 
the dimension of any subspace in C, where p — Q or p — 1. 
Once we have Y we distinguish between two cases to form a 
new subspace Z of F" 
Case 1: (5 is even. 

. lf(.=p [mod 2) then Z = Ylj{v + Y). 

. Ifi^p {mod 2) then Z = Y. 
Case 2: 6 is odd. 

. lfl = p [mod 2) then Z = Y. 

. Ifi^p {mod 2) then Z ^Y\j{v + Y). 

Now we use the decoding algorithm of the code C with the 
word Z. The algorithm will produce as an output a codeword 
X.htt X ^ Xr\Q and X' be the subspace of F^'^^ obtained 
from X by deleting the last entry of X. We output X' as 



the submitted codeword X' of 



^Q-.-v 



The correctness of the 



decoding algorithm is an immediate consequence from the 
following theorem. 

Theorem 6: If ds{X' , Y')<5-1 then X' = X' . 

Proof: Assume that ds{X' , Y') < 5 - 1. Let X C Q be 
the word obtained from X' by appending a symbol to the end 
of each vector in X' (this can be done by using the generator 
matrix E{Q) of Q). \f u ^ X' r\Y' then we append the same 
symbol to u to obtain the element of X and to obtain the 
element of Y. Hence, ds{X,Y) = ds{X' ,Y') < 5 - I. If 
dsiX,Y) <S -2 then ds{X,Z) < ds{X,Y) + 1 < S - I. 
Now, note that if (5 — 1 is odd then Z does not have the same 
parity as the dimensions of the subspaces in C and if J — 1 
is even then Z has the same parity as the dimensions of the 
subspaces in C Therefore, if ds{X, Y) = S — 1 then by the 
definition of Z we have Z = Y and hence ds{X, Z) = 5 —\. 
Therefore, the decoding algorithm of C will produce as an 
output the unique codeword X such that ds{X ,Z) < S — 1, 
i.e., X — X. X' is obtained by deleting the last entry is each 
vector of X nQ; X' is obtained by deleting the last entry is 
each vector of A:" n Q. Therefore, X' ^ X' . ■ 

The constant dimension codes constructed in Section |IV] 
have the same dimension for all codewords. Hence, if C was 
constructed by our multilevel construction then its decoding 
algorithm can be applied on the punctured code Cq ^ . 

VI. Conclusion and Open Problems 

A multilevel coding approach to construct codes in the 
Grassmannian and the projective space was presented. The 
method makes usage of four tools, an appropriate constant 
weight code, the reduced row echelon form of a linear sub- 
space, the Ferrers diagram related to this echelon form, and 
rank-metric codes related to the Ferrers diagram. Some of 
these tools seem to be important and interesting for themselves 
in general and particularly in the connection of coding in the 
projective space. The constructed codes by our method are 
usually the best known today for most parameters. We have 
also defined the puncturing operation on codes in the projective 
space. We applied this operation to obtain punctured codes 
from our constant dimension codes. These punctured codes 



are considerably larger than codes constructed by any other 
method. The motivation for considering these codes came 
from network coding [21], [22], [23] and error-correction in 
network coding [2], [24], [25]. It worth to mention that the 
actual dimensions of the error-correcting codes needed for 
network coding are much larger than the dimensions given 
in our examples. Clearly, our method works also on much 
higher dimensions as needed for the real application. 

The research on coding in the projective space is only in its 
first steps and many open problems and directions for further 
research are given in our references. We focus now only in 
problems which are directly related to our current research. 

• Is there a specification for the best constant weight code 
which should be taken for our multilevel approach? Our 
discussion on the Hamming code and lexicodes is a first 
step in this direction. 

• Is the upper bound of Theorem [T] attained for all pa- 
rameters? Our constructions for optimal Ferrers diagram 
rank-metric codes suggest that the answer is positive. 

• How far are the codes constructed by our method from 
optimality? The upper bounds on the sizes of codes in the 
Grassmannian and the projective space are still relatively 
much larger than the sizes of our codes [4], [2], [11]. The 
construction of cyclic codes in [4] suggests that indeed 
there are codes which are relatively much larger than our 
codes. But, we believe that in general the known upper 
bounds on the sizes of codes in the projective space are 
usually much larger than the actual size of the largest 
codes. Indeed, solution for this question will imply new 
construction methods for error-correcting codes in the 
projective space. 
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